# 1B Parameter Fine-tuning
Minivla Wrist Vq Libero90 Prismatic
MIT
MiniVLA is a vision-language-action model focused on robotics, supporting multimodal tasks from image-text to text.
Image-to-Text
Transformers English

M
Stanford-ILIAD
18
0
Minivla Libero90 Prismatic
MIT
MiniVLA is a 1-billion-parameter vision-language model compatible with the Prismatic Vision-Language Model codebase, suitable for robotics and multimodal tasks.
Image-to-Text
Transformers English

M
Stanford-ILIAD
127
0
Featured Recommended AI Models